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ABSTRACT 

In this paper, we explore the constraints on the reionization history that are provided 
by current observations of the Lya forest and the CMB. Rather than using a partic- 
ular semi-analytic model, we take the novel approach of parametrizing the ionizing 
sources with arbitrary functions, and perform likelihood analyses to constrain possi- 
ble reionization histories. We find model independent conclusions that reionization is 
likely to be mostly complete by z = 8 and that the IGM was 50% ionized at z = 9 — 10. 
Upcoming low-frequency observations of the redshifted 21 cm line of neutral hydrogen 
are expected to place significantly better constraints on the hydrogen neutral frac- 
tion at 6 < z < 12. We use our constraints on the reionization history to predict the 
likely amplitude of the 21 cm power spectrum and show that observations with the 
highest signal-to-noise ratio will most likely be made at frequencies corresponding to 
z — 9 — 10. This result provides an important guide to the upcoming 21 cm observa- 
tions. Finally, we assess the impact that measurement of the neutral fraction will have 
on our knowledge of reionization and the early source population. Our results show 
that a single measurement of the neutral fraction mid-way through the reionization 
era will significantly enhance our knowledge of the entire reionization history. 

Key words: cosmology: diffuse radiation - cosmology: theory - galaxies: high-redshift- 
intergalactic medium 



1 INTRODUCTION 

Determining how and when the Universe was reionized is one 
of the major outstanding questions of modern cosmology. 
Observations of the cosmic microwave background confirm 
that hydrogen gas first became neutral at z ~ 1100 around 
400,000 years after the Big Bang. Observations of absorp- 
tion spectra of quasars at redshifts z < 6 indicate that the 
intergalactic medium (IGM) is largely ionized. Therefore a 
cosmic phase transition occurred whereby the IGM went 
from bein g fully neutral to being almost fully ionized (for a 
review see lBarkana fc Loebll200lh . Our picture of when ex- 
actly this phase transition occurred and what was the nature 
of the sources that drove it remains cloudy. 

In recent years, improved observational data has be- 
gun to probe the reionization era. By pursuing observations 
of the Lya forest in QSO spectra at ever higher r edshifts, 
Gunn -Peterson troughs |Gunn fc Peterson! Il965l : IScheuerl 
1 19651) have been see n along several quasar sight lines at z > 6 
( Becker et al.|[200ll ). providing an indication of an increase 
in the IGM neutral fraction at these redshifts. Interpreta- 



tion of these observations is subtle, since the IGM ioniza- 
tion state depends upon its thermal history, the spectrum 
of the ionizing sources, and the history of the production of 
ionizing radiation in those sources. 

Complementing Lya forest observations, is the detec- 
tion of the imprint of reionization in the polarization of the 
cosmic microwave background (CMB). Rescattering of CMB 
photons by the ionized IGM produces a distinctive "bump" 
in the CMB power spectrum o n large angular scale s that 
has been detected by WMAP i|Dunklev et alj l2009h . This 
places a constraint on the integrated optical depth tcmb to 
the surface of last scattering, and suggests that a signifi- 
cant fraction of the IGM was ionized by z ~ 10. In a few 
years time, results from the Planck satellitfQ are expected 
to tighten these constraints considerably. 

In addition to these relatively robust observations, a 
host of other observations contribute to our understand- 
ing of the state of the IGM at high redshift. These in- 
clude constraints o n the temperature of the IGM i n the 
range z = 2 - 4 (|Schave et all l200d : iRicotti et all l200d : 
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McDonald et alj l200ll ; IZaldarriaga et all l200ll ; ICen et all 
20091 ). These observations are challenging and their inter- 



pretation is complicated by the likelihood that Hell reion- 
ization is occurring at z ~ 3. Other observations include 
the e volution of the abundance and clustering of Lya emit- 
ters (iMalhotra fe Rhoads|l2004l) and of the evolution of the 
metallicitv (jBecker et alj|2009h . These observations are dif- 
ficult to interpret and as of yet offer only hints rather than 
definite evidence for how reionization proceeds. There also 
exist con straints on the number density of galax ies at high 
redshift jBouwens et al.ll2005l ; IStark et al.ll2007l ), although 
extrapolating from the few galaxies observed to their impli- 
cations on the ionization history is difficult due to the small 
number of candidate high redshift galaxies and the unknown 
escape fraction of ionizing photons. 

In the near future, it is hoped that observations of 
the redshifted 21 cm line of neutral hydrogen will pro- 
vide a direct probe of the state of the IGM at redshifts 
z = 6 — 15. Low frequency interferometers such as LO- 
FAE0, MWA0, PAPEE0, and SKA0 should, in principle, be 
able to map the ionization state of the IGM giving direct 
measurements of the ionized fraction Xi at several different 
redshifts. It is also hoped that future observations of high- 
redshift gamma ray bursts (GRB) will illu minate the state 
of the Lya forest at yet h igher redshifts ijBromm fc Loeb! 
120061 ; iMcQuinn et al . 2008) and the recent observation of a 
z « 8 GRB ( Tanvir et al. 2009h seems cause for optimism. 

On the theoretical side, numerical simulations offer one 
way of trying to reconcile these different observations and 
a variety of groups have ma de concrete progress in this 
area (for a recent review see iTrac fc GnedirJl2009h . Ana- 
lytic modelling is also possible and provides a useful way 
of exploring a large parameter space in order to assess how 
different sources of information const r ain re ionization. For 
example, IChoudhurv fc Ferraral (|2005l . I 2006) attempt to si- 
multaneously model many of the above observations with a 
single model in a self-consistent way. Despite progress, there 
is considerable uncertainty in the theoretical modelling of 
reionization, partly stemming from uncertainty in the rele- 
vant parameters and partly from the complexity of the in- 
teraction between different physical processes. 

In this paper, we combine these different pieces of obser- 
vational evidence in order to quantify our uncertainty on the 
reionization history. Given the numerous sources of astro- 
physical uncertainty, we argue that it is presumptive to claim 
that we can use analytic models to definitively constrain 
reionization in detail (e.g. Choudhury & Ferrara 2005). In- 
stead we propose arbitrary forms for the evolution of ionizing 
photon production. Using these arbitrary forms, we define 
the space of plausible models that fit the current data. We 
focus on the Lya forest and CMB observations which most 
tightly constrain reionization, and which are most readily 
predicted by an analytic model. We use a likelihood based 
analysis to place constraints on the ionization history, con- 
sidering two different source modelling parametrizations, in 
order to assess the systematic uncertainty in our conclusions. 
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Figure 1. Evolution of WMAP 1 — a constraints on the optical 
depth tcmb and, for comparison, the predicted error for Planck. 



Having made explicit the bounds of our ignorance in the 
ionization history, we explore the implications for the signal- 
to- noise ratio (S/N) that future 21 cm experiments might 
achieve at different redshifts. Finally, we turn this problem 
around and ask what 21 cm experiments will tell us about 
the reionization history that was not already implicit in our 
existing observations. 

The layout of this paper is as follows. In jj2]we discuss 
the current observational constraints on reionization that we 
will be using in our analysis. We then discuss in Sj3]the for- 
malism for using these constraints to infer the ionization his- 
tory given a particular model for reionization. In Sj4] we use 
this formalism to place constraints on model parameters and 
calculate probability distributions for the neutral fraction in 
different redshift bins. This is then used to make predictions 
for upcoming 21 cm instruments. Finally, we conclude in §5] 
The explicit details of our model of reionization are left to 
appendix [X] 

Throughout this paper, we assume a cosmology with 
VL m = 0.3, Q.A = 0.7, ft b = 0.046, H = 100/ikms -1 Mpc" 1 
(with h = 0.7), ns = 0.95, and as = 0-8, co nsistent with the 
latest measurements (|Komatsu et al.ll2009h . 



2 OBSERVATIONAL CONSTRAINTS ON 
REIONIZATION 

We now turn to the existing observational constraints on 
reionization. Constraints on reionization fall into two main 
types: those, like tcmb, that depend solely upon the ion- 
ization history Xi(z) and those, like the Lya forest, that 
constrain the sources and ionizing background that drives 
reionization. In this section, we discuss existing and future 
constraints and specify the data sets that we will use for our 
inference. We defer the details of our model of reionization 
to Appendix [A] 

Perhaps the most robust constraint on reionization 
comes from WMAP measurements of the optical depth to 
the surface of last scattering. Figure [1] graphically illus- 
trates the evolution of measurements of tcmb • Follow- 
ing the inclusion of improved polarisation measurements in 
WMAP3, the best fit value appears to have stabilised with 
the W MAP5 value t C mb = 0.087 ± 0.017 (jPunklev et all 
2009). Planck, which will have better polarisation sensitivity, 
should improve this significantly with Fisher matrix calcu- 
lations predicting con straints at the level of oy = 0.005 (e.g. 
IColombo et alj [20081 ). although this may degrade to a T — 
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0.01 once foreground modeling is included l|Tegmark et al.l 
l2000h . 

Since observations suggest reionization is essentially 
complete by z ~ 6.5, only a fraction of tcmb requires further 
explanation. Assuming hydrogen and Hel are fully ionized 
by z = 6.5 and that Hell reionization occurs at z — 3 gives 
tcmb = 0.044. This leaves about half of the observed optical 
depth At w 0.043 to be explained by ionization at higher 
redshifts. 

In the instantaneou s model used by the WMAP5 analy- 
sis (|Dunklev et alj|2009f ) , the observed tcmb corresponds to 
a reionization redshift of z r i = 11 ± 1.4. For more extende d 
models of reionization, such as those of iFurlanettol i|2006l ). 
full reionization occurs later, but with a significant tail to 
higher redshifts. Since tcmb is an integral over the ioniza- 
tion history, the optical depth constraint essentially gives 
us a constraint on the duration of reionization or equiv- 
alently the redshift at which star formation began. With 
much better data, one could imagine finding inconsisten- 
cies between tcmb and direct measurements of the reion- 
ization history indicating that other processes were respon- 
sible for modifying the ionization history at high redshift 
jChen fc Kamionkowskill2004l ; lMack fc Weslevll2008l ) . In this 
paper, we assume that all of tcmb arises from reionization by 
astrophysical sources and use the reported WMAP5 tcmb 
value. 

In principle, CMB observations are sensitive to more 
than just tcmb since the shape of the reionization history 
will modify the pol arization angular power spectrum on an- 
gular scales £ < 50 jKaplinghat et aLll2003l ). Unfortunately, 
realistic reionization histories lead to only small modifica- 
tions that are har d to detect with ev en cosmic variance lim- 
ited observations dHolder et al ]|2003l ). It is therefore unlikely 
that CMB missions will provide much inf ormation on reion- 
izatio n beyond that contained in tcmb (Zaldarriag a et all 

iH). 

Turning now to astrophysical constraints, we begin with 
observations of the Lya forest in quasar absorption spec- 
tra. These place constraints on the ionizing background 
produced by luminous sources. The measured quantity is 
the mean transmittance of Lya flux (F) along the line of 
site to a quasar or equivalently the effective optical depth 
T eff = -log(.F) (fSchave et al.ll2003l ; ISongailall2004l ; iFan et all 
120061 ; IFaucher-Giguere et al.l 120081 ). With knowledge of the 
physical properties of the IGM, r e ff can be connected to 
the background photoionization rate per hydrogen atom 
T = r_i2 x 10 _12 s _1 . This quantity serves as a constraint 
on the sources of ionizing radiation that drive reionization. 

However, the conversion of a measured value of r e g 
into F_i2 is a non-trivial process requirin g the use of 
hydr o -dynamical simulations of the IGM ( Bolton et al.l 
120051 ; iBolton fc Haehnerd l2007h or the use of the fluctuat- 
ing Gunn-Peterson approxima tion (FGPA) l|Fan et al.ll2006l ; 
IFaucher-Giguere et al.l [200 83;). It is further plagued by our 
poor understanding of the relationship between overdensity 
A and temperature Tigm in the IGM. This is traditionally 
modelled at a power law T = Tp A , the asymptotic f orm af- 
ter reionization has taken place (|Hui fc Haim an 2003). How- 
ever, recent simulations have shown that the situation at 
z ~ 3 may be signifi cantly more compl i cated as a result 
of Hell reionization l|Bolton et al.l 120081 ; iTrac et all 120081 ; 
iMcQ uinn et all l2009t ) with there being some evidence for 



Table 1. Constraints on r e fj used in our analysis (taken from 
IBolton &: Haeh neltJ J2007T) a nd based on data from lSchave et all 
[|2003h . lSongairal <2004D . and lFan et al.l ll2006h ). 



T c ff 



ff 



3 0.362 

4 0.805 

5 2.07 

6 > 5.50 



0.035 
0.067 
0.27 



a temperature inversion. At z > 4, there are curren tly no 
direct measurements of this relation (|Cen et al.ll2009l ). 

Since T is a mostly local measurement, it primarily con- 
strains the sources at the redshift at which it is measured. 
In this paper, we assume that our model of sources should 
consistently drive reionization and also match observat ions 
shortly after reionization has completed (|Meiksinll2005l ). It 
is important to remember that ioniz ing photons may come 
from q uasars in addition to stars. IFaucher-Giguere et al.l 
l|2008al ) show that, whereas the contribution from qu asars 
drops rapidly at redshift z > 4 (Ho pkins et al. 2007), the 
contribution from galaxies remains fairly constant, suggest- 
ing that galaxies are the dominant source of ionizing photons 
during reionization. Since it is these sources that we will be 
interested in constraining, we will consider only F_i2 con- 
straints at z > 4. 

The constraints on T e g that we use are sh own in Ta- 
ble [T] and taken from IBolton fc Haehneltl (|2007| . henceforth 
BH07). Importantly, these contain invaluable data points at 
z — 5 and z — 6, whi le other authors have focussed o n z < 4. 
The constra ints from lFaucher-Giguere et al.l l|2008al ) (hence- 
forth|FG0j), which utilise a larger quasar sample but extend 
only to 2 = 4.2, are consistent with these, although the lBH07l 
points are slightly lower at z ~ 4 (proba bly as a result of 
not correcting for "continuum bias" . See |fG08 for discus- 
sion). These two authors quote constraints on P_i2 that are 
systematically offset from one another, mostly as a result 
of different assumptions about the uncertain temperature- 
density relation. We therefore choose to work from r e s and 
not r_i2 , so as to have control of the assumptions that are 
included. Our model for converting T e g into T_i2 is detailed 
in CAT1 

As alluded to above, a knowledge of the temperature- 
density relationship of the IGM is an important ingredi- 
ent in converting T g observations into P_i2 constraints. 
The IGM temperature can also potentially constrain when 
reionization took place. After reionization, adiabatic cooling 
rapidly brings the IGM temperature-de nsity relation to an 
asymptotic form (|Hui fc Haimanl feoOS), Observing an ex- 
cess temperature over this might indicate that reionization 
had occurred recently, so that not enough time to reach the 
asymptote had elapsed. Measurements of the IGM temper- 
ature come from Ly a forest observations at z < 4 from fit- 
ting to line widths dSchave et all l2000t iRicotti et all l2000l : 
iMcDonald et al]|200ll ) or from measurements of the small 
scale flux power spectrum l|Zaldarriaga et al1l200ll ). The pic- 
ture arising from these measurements is unclear and when 
the temperature-density relationship is parametrized as a 
power-law Tigm = ToA 13 the parameters are only loosely 
constrained in the range To ~ (1 — 3) x 10 4 K and /3 w 0—0.8, 
depending on the technique and data set. Given the current 



© 0000 RAS, MNRAS 000, 000-000 



4 Jonathan R. Pritchard, Avi Loeb, and Stuart Wyithe 



level of the data and the importance of Hell reionization, we 
choose not to include temperature constraints in our analy- 
sis and only use these observations to set a reasonable prior 
on all owed values of To and f3 (see IChoudhurv fc Ferraral 
for an alternative approach). 
Constraints on T_i2 may be converted into constraints 
on the rate per unit volume at which sources produce ioniz- 
ing photons JVion, given assumptions about the source spec- 
trum, the mean free path of ionizing photons, and the dis- 
tribution of absorbing systems. These properties are poorly 
constrained by direct observation, as is discussed in more 
detail in CA2l 

Turning now to the future, more direct constraints on 
the ionization history may be possible with future 21 cm 
observations. Experiments such as MWA, LOFAR, PAPER, 
and SKA will observe the 21 cm power spectrum P2i(k) over 
a range of wavenumber k in different redshift bins. Analytic 
models and simulations suggest that the shape of P2i(k) on 
scales k ~ 0.1 — 1 Mpc - will provide good constraints on 
the bubble fillin g fraction Xj(z) in each redshift bin. Fisher 
matrix analyses (jBowman et al. I l200rj ; iMcQuinn et al.ll2006ft 
suggest that MWA will be able to place constraints on xh 
in a fully neutral IGM at z — 8 at the a Xi = 0.2 level, while 
SKA should be capable of placing constraints of a Xi — 0.07. 
Adding in Planck CMB data to break degeneracies with cos- 
mological parameters improves these constraints by a factor 
of ~ 2. Similarly, by lo oking at th e evolu tion of the am- 
plitude and slope of fii . iLidz etaD (|2007h find that MWA 
could constrain the midpoint of reionization with accuracy 
Xi = 0.5 ± 0.05. 21 cm experiments may also give informa- 
tion about astrophysical parameters such as the mi nimum 
mass in which the first galaxies form (|Barkanall2008T ). 

The success of these instruments is dependent upon 
reionization occurring within their sensitivity window. These 
experiments are initially only likely to be sensitive to the 
power spectrum for frequencies above 100 MHz (although 
they are designed to cover frequencies down to 80 MHz), 
which corresponds to redshifts z < 13. At higher redshifts, 
synchrotron emission from the galaxy becomes extreme, ter- 
restrial radio interference becomes more problematic, and 
the Earth's ionosphere is more of a challenge. It is therefore 
important to gauge what the ionization history is doing in 
this range of redshifts. 

A second use of the 21 cm line comes from l ooking at the 
evolu tion of the global brightness temperature (jShaver et alj 
1 1999ft. rather its fluct uations. Experiments such as EDGES 
l|Bowman et al.l [20071 ). which look for a sharp transition in 
the 21 cm signal brought about by reionization essentially 
constrain dTb/dz and so dxi/dz. Beating experimental sys- 
tematics down by a factor of 10 or so should enable these ex- 
periments to begin placing meaningful constraints on Xi(z). 
Again, it is important to ascertain the amplitude of the sig- 
nal that these experiments may hope to see. 



3 INFERENCE OF IONIZATION HISTORY 

We next summarise our methodology for inferring the ion- 
ization h i story. For a summary of Bayesian inference see e.g. 
iMacKavl 42003ft . 

We wish to attempt predictions for 21 cm observations 
given observational constraints from the Lya forest and the 



CMB. To do this we make use of Bayes theorem 
p{D\w,M)p(w\M) 



p(w\D,M) 



p(D\M) 



(1) 



where M is a model for reionization with parameters w, 
and D is the combination of observational constraints on 
Mon and t cmb • The evidence p(D\M) provides an overall 
normalization for the posterior probability p(w\D, M). We 
must specify our prior p(w\M) on the space of model param- 
eters. Where measurements are poor, we take flat priors over 
a specified range for each parameter, thus p(w\M) — const. 
Our choice of prior should only be important if the data 
only weakly constrains the ionization history. With these 
assumptions the posterior probability is proportional to the 
likelihood p(D\w, M), which is straightforward to calculate. 
Note that we will make the simplifying assumption of Gaus- 
sian errors in the observational constraints on r e ff and tcmb 
when calculating the likelihood, although this is by no means 
guaranteed. 

The first step of our inference procedure will be to use 
Eq. ((TJ) and the data to constrain the model parameters for 
a set of different parametrizations. Then, since each model 
provides a definite prediction for Xi(z), we can calculate the 
probability distribution for Xi at a given redshift from 



p(xi\M) = I Awp(w\D,M)5[xi(w\M) - Xi 



(2) 



This gives us a bound on the ionization history given our 
choice of the form of the source model. The same procedure 
can be applied to calculate bounds on other quantities such 
as &Xi/dz or the S/N for a 21 cm experiment. 

The evolution of sources is likely to be complex and 
to be resistant to description by a small set of numbers. 
However, by looking at a handful of models and seeing if 
the predictions are relatively consistent, we can still hope to 
obtain meaningful predictions. 



4 RESULTS 

4.1 Constraints on JVion 

We begin by mapping the constraints in r e ff into con- 
straints on r_ i2 and JVion- This has been the subject of ex- 
tensive work dBolton et al.l 120051 : iBolton fc Haehneltl [20071 ; 
iFaucher-Giguere et al. 2008aft . "with each author making 
their own preferred choice of assumptions about uncertain 
astrophysical parameters. In order to control the assump- 
tions used and to preserve the effect of the non-linear map- 
ping of constraints, we perform our own analysis, but where 
possible we have checked that our model, which is described 
in detail in Appendix [X] reproduces the results of previous 
work. 

The main parameters in our model of reionization are 
the two parameters describing the temperature-density re- 
lation To and /3, the spectral index of the sources as, and 
the power law index of absorbing systems 7. We further 
allow for an uncertainty in the overall normalisation of the 
mean free path k. Alongside these astrophysical parameters, 
we allow for uncertainty in the normalisation of the matter 
power spectrum as, the total matter density Q m , the baryon 
density fit, and the Hubble constant h. 
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Table 2. Fiducial parameter choices 



Parameter 


(-^low) 


u x Ohigh) 


prior 


To 


0.5 x 10 4 K 


3.0 x 10 4 K 


uniform 


P 





0.6 


uniform 


as 


1 


3 


uniform 


7 


1 


2 


uniform 


K 


1 


0.2 


gaussian 


&8 


0.8 


0.05 


gaussian 




0.3 


0.04 


gaussian 




0.046 


0.0005 


gaussian 


h 


0.7 


0.04 


gaussian 





Figure 2. Likelihoods for T e g (top panel), T_i2 (middle panel), 
and 7Vi on (bottom panel, in units of 10 51 s _1 Mpc - 3 ). In each 
case, we plot the distribution for 2 = 4 (dashed curves), 2 = 5 
(dotted curves), and 2 = 6 (solid curves). 



Assuming the parameter priors in Table [5] we propa- 
gate uncertainties by randomly drawing specific realisations 
of the parameters from their respective distributions, calcu- 
lating the resulting T_i2 and iVi on and repeating in order 
to fully sample the final distributions. Figure [2] shows the 
resulting uncertainty on T_i2 and iVi on . Notice that in each 
case there is considerable uncertainty on each of the param- 
eters. 

Our constraints at each redshift are summarised in Ta- 



Table 3. Summary of constraints. For r_i2 and iVi on we show the 
most likely value and errors containing 68-% and 95-% confidence 
intervals. 



Redshift 



T"cff 



0.805 ± 0.067 
2.07 ±0.27 
5.5 - 15 



0.57 



+0.35 (+0.71) 
0.10 (-0.22) 
„ „„+0.39 (+0.83) 
u - do -0.06(-0.16) 
n n o+0.H(+0.25) 
u - uo -0.002 (-0.02) 



„ „ n +0.53(+l.l) 
u - ou -0.19 (-0.40) 

n sn+ - 53 < +1 - 2 ) 

u.ou_ Q 21 /_ 4 2) 

n /ts+ - 34 (+ - 75 ) 

u -^°-0.12 (-0.25) 



ble|3] where we list the most likely value as well as error bars 
containing 68-% and 95-% confidence intervals. Besides the 
basic astrophysical parameters, we allow for an uncertainty 
in the overall normalisation of the mean free path and for 
uncertainty in the cosmological parameters. A more detailed 
analysis would take into account the covariance between the 
observational uncertainties and these parameters, but that 
lies beyond the scope of our analysis. Note that the error 
bars are highly asymmetric, emphasising the need to con- 
sider the detailed probability distribution in our analysis. 
The top p anel of Fig ure [3] s hows these error bars alongside 
those from BH07 and lFG08l . Our error bars are deliberately 
quite conservative. 

Note that we treat the upper limit on t c r at z = 6 as a 
uniform prior over a wide range of t c h values. Increasing the 
upper limit here skews the constraint on N 10TL to lower values. 
Unfortunately, this is at the edge of observational capability. 
Improving these observations would go a long way towards 
constraining the evolution of the sources. 

Note that the iVjon constraints at z — 4 and z = 5 are 
essentially identical. This is an indication that the errors 
from parameters other than F_i2 dominate the error budget. 
Note that ou r cons trai nts on r_i2 are somewhat different 
from those of lFG08l and lBH07l since these are very sensitive 
to the temperature prior used. The overall normalization of 
the temperature is less important for the N[ on constraints 
since shifting the temperature changes the normalisation of 
F_i2, but the scaling of A m f p is such as to compensate in 
the calculation of Ai on (see Equation (|A7|l ). 



4.2 Modeling ionizing sources 

We next discuss our parametrization of the sources and con- 
sider two cases to explore the model dependence of our 
predictions. Since we are focussed on behaviour at z > 4, 
we neglect the known contribution of quasars, which is ex- 
pected to be small at these redshifts. The arbitrary nature 
of our parametrization of Ni on implicitly allows for a vari- 
ety of contributing sources especially galaxies and quasars, 
but coul d also include X-ray ion izations from early mini- 
quasars (jRicotti fc Ostrikerll2004l ) provided that these were 
unimportant for the Lya forest. 

The quantity JVi on is affected by two ingredients: the 
star formation rate and the number of ionizing photons per 
baryon in stars that escapes to ionize in the IGM. This moti- 
vates separating the two parts using the assumption that the 
star formation rate simply tracks the rate at which collapsed 
structures form. This leads to 



Mo 



•C(*W0) 



d/coii(z) 
dt ' 



(3) 



where we have factored out the unknown number of ioniz- 
ing photons per baryon, escape fraction, and star forming 
efficiency into a single ionizing efficiency £ = ATtjv /esc/*. 
We calculate / co ii assumi ng a Press-Schecter mass function 
jPress fc Schechterl Il974h and using the minimum galaxy 
mass from atomic hydrogen line cooling. Rather than at- 
tempt to model M on directly, we can then model in- 
stead. This has the advantage of separating out the expected 
rapid increase in the star formation rate, which results from 
the exponential increase of the collapse fraction. A physi- 
cally motivated model is one that interpolates between two 
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constant values 



C(*) = Co + 



(Ci - Co) 



tanh 



zo 



+ 1 



(4) 



2 L V Az 
This would result, for example, if an early period of Popula- 
tion III sta rs gave way to a later epoch of Population II star 
formation (jBromm fc Loebll20(56h . For this parametrization, 
we expect the iVj on constraints will fix the low z amplitude of 
£ while the optical depth constrains the high z contribution 
from the second population. Having four free parameters is 
really the minimum needed to allow for two populations of 
sources and so provide an interesting degree of flexibility in 
fitting the data. 

It might also be natural to model iVion directly as a poly- 
nomial with some redshift at which star formation switches 



iV A ion [l + Ni (z - zo) + N 2 (z- zo) 2 + N 3 (z- zof] 



where the normalisation A± a 



10 s 



x Q(z- 
s -1 Mpc" 



(5) 



and we vary z ma x, No, Ni, and N2 and adjust N3 so that 
Mon(«max) = 0. This allows considerable flexibility to fit 
the data, allowing for many qualitatively different ionization 
histories. In order to ensure that this parametrization does 
not lead to unphysically large values of 7Vi on we impose the 
weak prior that A^ on /Ai on < 10 at all redshifts. This only 
affects the results when we do not include the z = 6 Lya 
forest constraint. 

Even this brief discussion illustrates the problem of 
a suitable parametrization. With future observations, one 
could imagine more sophisticated modelling, but, as we shall 
show, current data is only sufficient to constrain two or three 
parameters making more detailed modelling premature. 

To illustrate some of the general features of our mod- 
eling, we show in Figure models with a constant value of 
£ = 30 and f = 45. This serves as a benchmark for illustrat- 
ing some of the more generic features of our model. The con- 
stant £ leads to Mon tracing the rate of collapse of structure, 
which peaks at z « 6 in our model and drops going to lower 
redshifts where the contribution from quasars is expected to 
take over and dominate. A lower value of £ leads to a delay 
in the ionization history, so that reionization completes at 
redshifts different by Az ps 1, and a lower Thomson optical 
depth to the CMB. In both cases, recombinations play little 
role until reionization is well underway. 

4.3 Ionization history 

We now use these constraints on Mon and tcmb to constrain 
our model parameters. To explore the effect of CMB data, 
we consider combinations of the Lya forest with WMAP3, 
WMAP5, and Planck. Although the central value of the 
observed tcmb appears constant between data sets, there 
is still the possibility of systematic errors at the level of 
ov CMB = 0.01 in the WMAP5 data. Relaxing the con- 
straint on t cmb gives a sense of how these systematic er- 
rors might change our results. We also consider constraints 
from WMAP5 in combination with only the z = 4 and 
2 = 5 Lya forest data, throwing out the z = 6 constraint. 
The 2 = 6 Lya forest is subject to considerable theoretical 
uncertainty since it requires extrapolating from the IGM 
properties observed at z = 4 the most. Moreover, in late 



T 1 1 1 1 I I I 




Figure 3. Models with constant Q(z) = 30 (solid curves) and 
f = 45 (dotted curves). We show the redshift evolution of JVi on 
(top panel, in units of 10 51 s — 1 Mpc -3 ), Xi (middle panel), and 
the clumping factor C (bottom panel). In the to p pane l, we show 
error b ars on N^ on calculated using the results of lBH07l (blue) and 
iFGOa (red - statistical only) alongside our constraints (black, see 
ED- 



reionization models 2 = 6 may occur close enough to over- 
lap that sp atial fluctuations in the temperature-density re- 
lationship llFurlanetto fc O hi 12 009) or ionizing background 
|Furlanetto fc Mesingerl I2009T ) may complicate the connec- 
tion between r e s and JV ioll . 

We calculate the likelihood for the parameters by uni- 
form sampling of the four dimensional parameter space. In 
each case, we assume a uniform prior on the source parame- 
ters. Although numerically somewhat inefficient this makes 
later analysis more straightforward. 

In addition to the CMB and Lya forest constraints, 
we make a cut throwing out all models with Xi(z = 6) < 
0.99. That the IGM is ionized to at least this level is well 
established by Lya forest observations and imposing this 
constraint ensures that HII regions have percolated placing 
us in the range of validity of our Lya forest modelling. 



4-3.1 £ parametrization 

Our physically motivated £ parametrization naturally de- 
scribes two epochs of star formation with differing efficien- 
cies. Figure [4] shows the marginalised probability distribu- 
tion function (PDF) for the four model parameters. We see 
that the Lya forest constraints on N; ori lead to a clear prefer- 
ence for £0 centered around C,o ~ 30 consistent w ith the ion- 
izing efficiency expected for Pop II stars (see e.g. iFurlanettol 
2006). In order to satisfy the tcmb constraint however a 
population of more efficient sources at higher redshifts is re- 
quired. This could arise, for example, from evolution in the 
escape fraction with redshift or from the contribution of a 
different population of sources at high redshift e.g. Popula- 
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Figure 4. Marginalised PDF for the f parametrization. Different 
combinations of data are plotted: Lya forest + WMAP5 (black), 
Lya forest (excluding z = 6) + WMAP5 (red), Lya forest + 
WMAP3 (blue), and Lya forest + Planck (black dashed). 



Figure 6. Marginalised PDF for the Ni on parametrization. Dif- 
ferent combinations of data are plotted: Lya forest + WMAP5 
(black), Lya forest (excluding 2 = 6)+ WMAP5 (red), Lya forest 
+ WMAP3 (blue), and Lya forest + Planck (black dashed). 
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Figure 5. Distribution of at redshifts z = 8, 9, 10, and 11 for 
the £ parametrization. Same curve styles as for Figure l4l 



tion III stars. The model prefers that this transition occur 
at 2 > 15, but there is little constraint on exactly when 
this transition needs to take place, since a higher transition 
redshift can be compensated for with a higher £i leading to 
a more extreme early burst of ionization. No preference for 
the width of the transition Az is shown. Including Planck 
type t cmb constraints significantly improves constraints on 
Co , Ci i an< A z o since it reduces the freedom to change the high 
redshift behaviour of the model. 



In Figure [5] we show the marginalised distribution of x% 
in four redshift bins of interest to upcoming 21 cm experi- 
ments. Although there is considerable uncertainty in Xi at 
each redshift the model does lead to clearly preferred ranges. 
Including the z — 6 Lya forest point pushes the distribution 
to slightly lower values of Xi since it pushes the normalisa- 
tion of £ downwards. Within this parametrization, the IGM 
is highly ionized by z = 8 (at least to x% > 0.8). Note that 
since < Xi < 1 there is a "pile up" effect leading to a 
peak at x% = 1, where models leading to early reionization 
all contribute. 

4-3.2 Nion parametrization 

We now compare results from the iVion parametrization, 
analysed in the same as way as in £14.3. 1 1 Figure [6] shows 
constraints on the model parameters. These display a bi- 
modal distribution that arises because of the exclusion of 
models that do not ionize the Universe early enough. 

This model has significantly more flexibility than the £ 
parametrization, which shows itself in the distribution of Xi 
seen in Figure [7] The distributions are very broad at z = 10 
and z = 11 indicating that the data does a poor job in con- 
straining possible histories within this parametrization. It is 
important to note that even with this increased flexibility 
we still see that the IGM is largely ionized by z — 8. Planck 
level tcmb constraints lead to better localised distributions. 



4-3.3 Comparison 

In Figure [8] we give a sense of the best fitting histories pro- 
duced by the two different parametrizations and combina- 
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Figure 7. Distribution of x; at redshifts z = 8, 9, 10, and 11 for 
the 7Vi on parametrization. Same curve styles as for Figure [6] 



Figure 9. Marginalised redshift distribution of X((z) = 0.5 for the 
f (top panel) and N Ion (bottom panel) parametrizations. Same 
line conventions as for Figure [6] 
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Figure 8. Redshift evolution of Xi (top panel) and JV; on (bottom 
panel, in units of 10 51 s — 1 Mpc -3 ) for the best fitting history 
for the f (red curves) and N\ on (blue curves) parametrizations. 
Different combinations of data are plotted: Lya forest + WMAP5 
(solid), Lya forest (excluding 2 = 6) + WMAP5 (dotted), and 
Lya forest + Planck (dashed). Error bars show the constraints 
on JV; on from Table [3] 



tions of data. There is clearly considerable spread in the 
sorts of history that are consistent with observations. 

Two natural milestones in the reionization of the Uni- 
verse are the point at which Xi(z) — 0.5 and when reioniza- 
tion completes. This first point, Xi(z) = 0.5, is thought to 
lead to distinctive signatures in the 21 cm signal (jLidz et al.l 



1200 it ). In Figured we plot the PDF for finding x t = 0.5 
at a given redshift. Although there are differences in the 
spread of uncertainty between our two parametrizations, we 
see that in each panel the peak of the distribution lies around 
2 = 9 — 11. Existing data has a relatively strong preference 
for the Universe to be half ionized by z = 9. There is also 
a marked tail of the distribution at redshifts greater than 
this. 

The second milestone, the end of reionization, is a some- 
what slippery moment to define. One might define it as the 
point when bubbles percolate, so that x% = 1, or define it 
in terms of the point where the mean free path of photons 
grows to the horizon size, so that every gas element sees all 
sources. Moreover making predictions for the final few per- 
cent of the ionized fraction depends sensitively on the mean 
free path of the photons and the clumping. For the pur- 
poses of planning 21 cm instruments, the important moment 
is when the filling fraction of neutral regions drops to a few 
percent, so that the signal strength becomes small. In Figure 
1101 we plot the probability that x% > 0.95 at a given red- 
shift. This clearly indicates that, in both parametrizations, 
reionization is preferred to be mostly complete by z = 8. 
It is important to note that, while the predictions for the 
midpoint of reionization are driven almost entirely by the 
CMB, the Lycv forest provides very useful information in 
determining when reionization completed. 

It is clear from the preceding results that the A?i on 
parametrization has much greater flexibility in fitting the 
Lya forest data and so leads to more conservative re- 
sults. The £ parametrizations is quite restrictive in that 
the low redshift behaviour is very well constrained by the 
Lya forest. As a result, much of the tightness of the con- 
straints in the C, parametrization is driven by the form of 
the parametrization itself. This highlights the importance 
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Figure 10. Probability at a given redshift that Xi > 0.95 for the 
C (top panel) and iVi on (bottom panel) parametrizations. Same 
line conventions as for Figure [6] 



Figure 11. Confidence intervals at the 95% (dotted curves) and 
68% (dashed curves) levels and median history (solid curve) for 
the ionization history for the £ (top panel) and -/V; on (bottom 
panel) parametrizations using Lyc* forest and WMAP5. 



of choosing parametrizations capable of representing a wide 
variety of reionization histories with only a few parameters. 



4.4 21 cm fluctuations and observations 

4-4-1 Estimating the 21 cm power spectrum 

We now turn to making predictions for 21 cm experiments. 
Of particular interest is the ionization history. In the pre- 
vious section, we calculated the marginalised PDF of Xi in 
separate redshift bins. We use the same calculation to cal- 
culate constraints on Xi as a function of z using the cumu- 
lative probability to establish confidence intervals for the 
model. Figure [11] shows 68-% and 95-% confidence intervals 
for the ionization history calculated in our two models. It 
should not be surprising from our previous discussion that 
the constraints on Xi(z) are significantly less tight in the 
more flexible Mon parametrization. This is a strong indica- 
tion of how parametrization dependent our results are (note 
that using a power law for £ leads to similar constraints 
as a power law in Afi on ). Regardless of the parametrization, 
the Universe is likely to have been fully ionized by z — 8, 
p(xi{z = 8) > 0.99) > 0.5. 

Although having a constraint on the ionization history 
is useful, it is advantageous to map this into the expected 
amplitude of the 21 cm brightness temperature power spec- 
trum P2i- In order to achieve this mapping, we need two 
pieces of information: the evolution of the mean bright- 
ness temperature and a mapping between Xi(z) and the 
amplitude of the fluctuations on a given scale. Predict- 
ing the evolution of the mean brightness temperature re- 
quires, in addition to a knowledge of Xi(z), information 
about the IGM temperature and the Lycv background re- 
sponsible for coupling the spin and kinetic temperatures. 
There is considerable uncertainty in predicting these quanti- 



ties (see e.g. lFurlanettoll2006l : IPritchard fc Furlanettoll2007l : 
iPritchard fc Loebf 200S ) , so we will instead naively assume 
that the gas has been heated to Tk 3> Tcmb and that 
Ts — Tk as a result of strong Wouthysen-Field coupling. 
With these assumptions, we have 



T b = 27 x m 



0.023 



0.15 1 + z 
Q, m h? 10 



1/2 



mK. 



(6) 



To map a given Xi(z) into the desir ed P2i(k,z), we assume 
the validity of the analytic model of lFurlanetto et all (|2004h 
to calculate the power spectrum of the ionized component 
P xx . This model, based on the excursion set formalism, has 
been shown to provi de a good match t o numerical simula - 
tions of reionization l|Zahn et al.ll2007l ; fSantos et ail 120071 ). 
Moreover it has been shown that the spectrum of fluctua- 
tions can be well modelled given only xt; it is surprisingly 
robus t to other astrophysic al parameters once Xi has been 
fixed l|McQ uinn et alj|2007l ). We numerically build a lookup 
table specifying P21 as a function of z and Xi, which is then 
used to map our constraints on x% into constraints on P2i(fc), 
where we fix k — 0.1 Mpc^ 1 a scale roughly in the center of 
the range accessible to the first generation of 21 cm experi- 
ments. 

The sensitivity of the MWA to the 21cm power spec- 
trum will be limited by thermal noise in the antennae 
rather than by survey volume. At the relevant frequen- 
cies the thermal noise will be dominated by the galac- 
tic foreground, which is observed to have a temperature 
Tsky ~ 0.9(1 + z) 2 ' 6 K. The thermal noise for the observa- 
tions therefore rises steeply at high redshift, implying that 
better signal to noise will be obtained at lower redshifts. A 
useful outcome of our analysis is a prediction of the redshift 
where the MWA might be most likely to achieve the highest 
SN detection. The likelihood for a given redshift is propor- 
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Figure 12. Confidence intervals at the 95% (dotted curves) 
and 68% (dashed curves) levels and median history (solid curve) 
for A-r./Tgky for the £ (top panel) and Ni OIL (bottom panel) 
parametrizations using Lyc* forest and WMAP5. Solid red curves 
indicate the thermal noise for MWA. 



Figure 13. Confidence intervals at the 95% (dotted curves) 
and 68% (dashed curves) levels and median history (solid curve) 
for dxi/dz for the £ (top panel) and Ni on (bottom panel) 
parametrizations using Lya forest and WMAP5. 



tional to the amplitude of the PS divided by the square of 
the temperature of the sky. Thus, consideration of the sys- 
tem noise weights the sensitivity of the instruments to lower 
redshift ranges. Our results from this exercise are shown in 
Figure 1121 where we plot the power per logarithmic interval 
Ar b — \/k 3 P2i/2n 2 divided by T sky and for clarity in the 
scales have multiplied the resulting S/N by 10 5 . As expected 
the curves cut off at low z as the gas becomes fully ionized 
and all the curves tend towards the same asymptote at high 
z where the gas is fully neutral. We find that 21 cm experi- 
ments are likely to achieve the highest signal-to-noise ratio 
in the range z — 9 — 10. 

As an example of a 21 cm experiment, in Figure 1121 
we plot the expected ther mal noise for MWA ass uming the 
configuration described in iMcQuinn etafl (|2006l ), an inte- 
gration of 2000 hours on two places on the sky and a band- 
width of 6 MHz. We have allowed the collecting area of the 
dipoles to scale as A 2 , except above z — 8 where the area 
has been capped to reflect geometric shadowing within the 
antennae tiles. This clearly indicates that there is a good 
chance of MWA detecting the 21 cm signal over a wide range 
of redshifts. 



4-4-2 The global 21 cm signature 

Experiments sensitive to the global 21 cm signal will be most 
sensitive to sharp features in Xi(z) where the derivative of 
dxi/dz is large, since smooth modes are removed with the 
foregrounds. Converting the reionization history into the ex- 
pected signal is difficult, since it depends upon both the fore- 
grounds and the instrumental response, which modifies the 
shape of the foregrounds from a simple power-law. In Fig- 
ure[l3] we show the confidence intervals on dxt/dz. Since, in 



most of our histories, reionization takes place over Az > 2 
the derivative seldom becomes large. Since our reionization 
histories are monotonic the derivative is always negative. 
More detailed calculation, including the effects of gas heat- 
ing and Lya coupling, would show more complicated be- 
haviour. 

Since 21cm observations are able to measure the ioniza- 
tion fraction x% during the central parts of reionization (Lidz 
et al. 2007) , it is interesting to ask how such a measurement 
would aid our knowledge of reionization during other stages 
of the process. 



4-4-3 The impact of a 21 cm measurement 

In Figure 1141 we show the marginalised PDF for Xi in sev- 
eral redshift bins given a measurement of Xi(z = 9.5) = 
0.5±0.05. Here we chose a fiducial value consistent with that 
of our previous analysis and take an uncertain ty broadly 
consi stent with that quoted in the analysis of ( Li dz et"al1 
2007) for the MWA. We see that adding a direct measure- 
ment of Xi improves the constraints on the evolution of x% 
considerably. This feeds into the constraints on the model 
parameters and conceivably into our knowledge of the un- 
derlying astrophysical sources. 



4-4-4 Estimating tcmb from 21 cm measurements 

Finally, we consider a more speculative point. From the 
point of view of constraining fundamental cosmological pa- 
rameters, such as the tilt and running of the inflationary 
power spectrum, the optical depth to last scattering is a nui- 
sance parameter. If tcmb could be independently measured 
to high accuracy, so that it was in effect a known quantity, 
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Figure 14. Distribution of Xi at redshifts z = 8, 9, 10, and 11 
when 21 cm measurements are included. In each panel, we plot the 
distribution of the Q (black) and iV; on (red) parametrizations with 
(solid curves) and without (dotted curves) a 21 cm measurement 
of Xi(z = 9.5) = 0.5 ± 0.05. 
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Figure 15. Distribution of tcmb given 21 cm measurements of 
Xi(z = 8) > 0.8 and Xi(z = 9) = 0.5 ± 0.05 in combination 
with Lye? forest data for the £ (dotted curve) and 7V; on (dashed 
curve) parametrizations. For comparison, we plot the WMAP5 
constraint (solid curve) assuming Gaussian errors. 



then it could be removed from the uncertainty in CMB pa- 
rameter estimates. At least in principle, 21 cm experiments, 
by placing direct constraints on Xi in many redshift bins, 
could constrain tcmb directly. Whether this is possible in 
practice will be determined by how much of the ionization 
history where Xi > is accessible to 21 cm observations. 
Given that CMB constraints will approach ov CMB ~ 0.005 
most of the ionization history must be mapped for 21 cm 
constraints to be competitive. 

Figure [15] shows the errors on tcmb achieved within our 
modeling by adding two 21 cm measurements of Xi(z — 8) > 
0.8 and x%{z = 9) = 0.5 ± 0.05, similar to what might be 
achieved by first generation experiments, to the Lya for- 
est data. Clearly, the distributions are not competitive with 
the WMAP5 constraint unless the source evolution is quite 
constrained. Nonetheless constraints on tcmb from the di- 
rectly measured ionization history may provide a useful con- 
sistency check with the results from CMB experiments. 



5 CONCLUSIONS 

In this paper, we have developed a framework for combin- 
ing existing constraints on reionization from CMB and Lya 
forest observations, and used them to place bounds on the 
reionization history. Modeling of reionization is very uncer- 
tain, both in terms of the sources of radiation (their evo- 
lution, spectrum, clustering, etc.), and in terms of their 
interaction with the neutral IGM. Rather than employ a 
semi -analytic model, with all t he necessary assumptions 
(e.g. IChoudhurv fc Ferrardliooll ). we have therefore chosen 
a more general approach which utilises arbitrary forms for 
the evolution of ionizing emissivity. This approach avoids 
theoretical prejudice regarding the model for reionization 



at the greatest possible level. The framework that we have 
used here can easily be extended to incorporate more gen- 
eral parametrizations of the sources as improved data makes 
that worthwhile. 

The resulting analytic model for reionization and the 
Lya forest is described in Appendix [A] This approach 
remains model dependent. However the level of depen- 
dance has been quantified by considering two different 
parametrizations for the sources. These give consistent pre- 
dictions on Xi(z), albeit with differing levels of precision. 
Nevertheless, we are able to draw some relatively robust 
conclusions from our analysis. The main conclusion is that 
current observations suggest reionization was largely com- 
plete by z — 8 with the Universe likely to have been half 
ionized between z — 9 — 10. This result has important im- 
plications for upcoming 21 cm experiments. In particular 
we have shown that the signal-to-noise ratio for these ex- 
periments is most likely to be at its maximum in the range 
z — 9 — 10. Thus current observables of reionization can 
be used to guide the first attempts at 21cm observations, 
which will be very expensive in terms of observing time, 
and cover only a fraction of the reionization era at a time. 
We also demonstrate that following a 21cm derived measure- 
ment of the ionization fraction midway through the reioniza- 
tion era, the constraints on the entire history will be greatly 
enhanced. Finally, we speculate that future 21 cm experi- 
ments might begin to constrain tcmb sufficiently to provide 
an independent consistency check on CMB results. 
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APPENDIX A: MODELING REIONIZATION 

In this Appendix, we describe the details of our model for 
reionization and the framework that we use to connect differ- 
ent observables. We begin with our model for the Lya forest, 
which leads naturally into predictions for the evolution of 
the IGM neutral fraction and thus the CMB optical depth. 
This model makes a number of simplifying assumptions and 
should not be considered a true substitute to detailed nu- 
merical simulation. It does however provide a reasonable fit 
to existing observations and enables us to evaluate the large 
region of parameter space necessary for our analysis. 




A=(l+(5) 



Al Mapping r c g to T_i2 

We wish to convert the observed values of the Lya effective 
optical depth, which is defined via the fractional transmit- 
tance 



(Al) 



into values for F_i2 . To achieve this we m ake use of the fluc- 
tuating Gunn-Peterson approximat ion (| Weinberg fc et all 
1 19991 ; iFaucher-Giguere et all l2008bh . This provides a self- 
consistent model for the Lya forest, although it leads to 
parameter scalings t hat only approxima te more detailed nu- 
merical simulations (|Bolton et al.|[2005T) . 
Neglecting redshift-space distortions 



exp(- 



(A2) 



where, assuming photo-ionization equilibrium, the optical 
depth to Lya photons is given by 



7i"e 2 / Lya 1 R(T)nmin e 



(A3) 



in terms of the recombination rate R(T) and the metagalac- 
tic ionization rate per hydrogen atom T = F_i2 x 10~ 12 s _1 . 
For case A recombination, R(T) = RoT~ ' 7 , with Ro = 
4.2 x 10~ 13 cm 3 s _1 /(10 4 K)~ ' 7 . In order to apply this, we 
assume a power law relationship between the IGM temper- 
ature and density 



T = T A P 



(A4) 



with A = 1 + 8. The unknown quantities To and j3 represent 
a significant uncertainty in our interpretation of the Lya 
forest. We will consider different values in our analysis. 

The observations of t c b come from averaging over the 
spectra of many quasars. We therefore average the transmit- 
tance (specified by the above equations as a function of A) 
over a probability density function (PDF) for A to get 



(F)(z) = dAP(A;z)exp(-r) 



(A5) 



We make use of the analytic -P(A) from 



Figure Al. Top panel: The PDF for A = 1 + S from MHR00. 
Note the lognormal shape with a long power law tail at high A. 
This PDF has been shown to fail for A > 10 and is also suspect 
at very low A. Bottom panel: The kernel for calculating the mean 
transmittance {F). It peaks at A ~ 0.3, corresponding to slightly 
underdense regions. The calculation will be affected by errors in 
the PDF at low A. 



iMiralda-Escude et all (|200CJ). Sever al recent papers 

jPawlik et al.l |2009| ; bolton fc Becker! l2009h have shown 
that this PDF fails to match numerical simulations at high 
densities where the PDF has a power law tail. The integral 
for (F) obtains most of its contribution from low-density 
regions (see Figure lAljt . with non-zero transmittance. At 
lower z, where the mean density is lower, larger over- 
densities begin to contribu te. Using the analytic PDF of 
(jMiralda-Escude et a l. 2000) rath e r than the more accurate 
numerical fit of iBolton fc Beckerl (|2009l ) makes an ~ 10% 
difference in the conversion of r e ff into T_i2. In summary, 
given the above equations, a PDF for A, and knowledge 
of To and f3, we can map a given value of t c s to the 
corresponding value of r_i2. 

A2 Mapping from T_i2 to iVi on 

We next seek to relate F_i2 to the rate at which sources 
produce ionizing photons iVi on . The comoving ionization rate 
is 



dv e v \ v a v , 



(A6) 



in terms of the emissivity of sources e„ ~ i/"° s_1 , the mean 
free path \ v , and the ionization cross section a v ~ u~ 3 . 
The mean free path here introduces another source of con- 
siderable uncertainty. The frequency dependence of \ v de- 
pends upon the distribution of column density of absorb- 
ing systems, so that for a power law distribution of col- 



umn densities /(r) 



the mean free path scales as 



A y = (f/fo) 3 ' 7 1 ' A m f p (^o), where Ao is the mean free path 
at the Lyman limit . Taking these frequency scalings leads 
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to the relation 



Mo 



= 10° 



, /Q S \-i /os + 3(2_7) 
r - 12 W) ( 6 



AmfpO^C)) \ 1 fl + Z X 



40 Mpc 



7 



s 1 Mpc" 



(A7) 



(A8) 



Here as is the spectral index of the sources just above the 
HI ionization threshold, since the frequency dependence of 
the cross-section causes the contribution of higher energy 
photons to drop rapidly. This spectral index enters as a 
source of uncertainty in the overall normalization of the 
inferred Monand we allow for a range between as = 1 
and as = 3 spanning the ra nge expected from Population 
III stars l|Bromm et al.ll200ll) with hard spectr a through to 
softer galaxy spectra (e.g lLeitherer et al"1ll999t) . 

In the post- o verlap reionization model of 
iMiralda-Escude et all {2000), the mean free path of a 
photon is given by 



A mfp = A (l + z)[l - F V (A < A,) 



-2/3 



(A9) 



Here Fv(Ai) = J A> dAP(A) is the fraction of gas by vol- 
ume contained in regions with density below a critical den- 
sity for ionization A < Ai. In order to calculate A, and Ao 
we must model the Lya f orest in more detail, wh ich we do 
following the arguments of lFurlanetto fc Oh] (120051 ) . We may 
associate the column density of a Lyman limit system to the 
critical density by a ssuming that the characteristic size is the 
local Jeans length {Schaye 200ll) and that photoionization 
equilibrium holds. Thus, assuming that A/hi ~ xhiAuhLj 
gives 



/ a \ 3/2 /i i 7 \ 9/2 

A/ H i = 3.3 x 10 17 cm" 2 (A) T^YZ\ 2 (l±l 

(A10) 

Setting A/hi (A) w 1/ffo = 1.6 X 10 17 cm 2 gives the critical 
overdensity for a self-shielding clump as 



Ai « 49.5 



( T 



V10 4 K 



1 



7 



^2/3 



(All) 



We will be focusing on z > 4 where Ai < 150, as such our 
density PDF should be valid. At higher densities, we might 
expect the details of star formation to become important 
leading to an extra source of uncertainty. 

This prescription implies a distribution of column den- 
sities in the Lya forest 



d 2 N 
dN m dz 



(1 + z) 2 dA An/A ,3c(l-y) ar _! 
~^frTTT AP(A) -1— > x m N m \ 



H(z) 



dN B 



(A12) 

This model provides a reasonable description of the dis- 
tribution of column densities of absorbing systems and gives 
us a way of calculating the unknown absorption probability 
per unit length Aq" 1 . An estimate of the spacing between 
Lyman limit systems is 



Alls 



cH- 1 /dJVi 



(1 + *) 



dz 



(A13) 



The mean free path should be comparable to this spac- 
ing, but we need to account for the distribution of sys- 
tems with differing column density to incorporate the cu- 
mulative effect of lower column-density systems. For a pho- 
ton at the hydrogen ionization absorption edge, this gives 



l|Miralda-Escudell2003l ) 



/-dTT-^l-e-) 



Amfp /i°°drr tA l 



(A14) 



where we have defined r = A/ H i/(1.6 x 10 1B cm" 2 ). Using 
the value of 7 corresponding to 2 = 3 gives A m f p « Alls/2. 
Hence, there is a redshift dependent numerical factor of 
about 2 connecting Ao with the mean separation between Ly- 
man limi t systems. Although th i s mod el agrees well with the 
results of lMiralda-Escude et al.l l|2000l ). we should be wary of 
the possibility of uncertainties in the normalisation arising 
from the ambiguity of using the Jeans' length as the charac- 
teristic size of the objects. In our calculations, we will there- 
fore allow for a normalisation offset k = A m f p /A m fp !mo dei. 
Setting out the calculation in this way facilitates comparison 
with observati ons of the number of Lyman l imit systems per 
unit redshift l|Storrie-Lombardi et al.lll994l ) and the distri- 
butio n of systems of different column density l|Misawa et al.l 
120071 ). Using this model and observational input, we may 
convert observations of T e g into constraints on Ni on . The 
main sources of uncertainty arise from as and Ao and from 
the temperature-density relation. 



A3 Evolution of the neutral fraction 

Given Ni on , we calculate the HII region filling fraction Qhii 
using 



dt 



n H (0) 



QmiCn H (z)a A (T). 



(A15) 



where C = {^Hii)/( n Hii} 2 is the clumping of ionized hy- 
drogen. Note that we assume case-A recombination, this is 
appropriate since the majority of recombinations occur on 
the edge of self-shielding dense clumps which will trap the 
ionizing photons produced by recombination direct to the 
ground state preventing them from contributing to ioniza- 
tion in the IGM. 

Besides the sources, the crucial uncertainty in the evo- 
lution of the neutral fraction lies in the dependence of the re- 
combination rate upon the clumping C. This quantity is ex- 
pected to be redshift dependent and vary significantly from 
G < 1 at high redshifts if low density regions are ionized 
first, to C > 4 at low redshift, where the remaining neu- 
tral gas is located in overdense regions. This variation may 
have significant effect on the reionization history, since a 
high clumping factor may delay the completion of reioniza- 
tion by Az ~ 2. It is therefore important that we explore 
a range of values for the clumping and allow for its red- 
shift evolution. Note that the clumping will generally be a 
function of both red shift and neutral fraction. 

In the model of iMHROOl . the recombination rate in an 
inhomogeneous Universe is related to the uniform recombi- 
nation rate R u by 



R = R U dAP^(A)A 2 = CR U - 



(A16) 



By itself, this formalism would underestimate the effec- 
tive clumping, since reionization proceeds via the perco- 
lation of ionized bubbles, which are produced by highly 
biased sources. This may be accounted for by calculating 
the recombination rate inside each bubble and averaging 
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over the distribu t ion of bubble sizes. We do this following 
iFurlanetto fe Ohl l|2005l ). Accounting for the bubble distri- 
bution increases the clumping factor by a factor of ~ 4. 
This model predicts clumping factors that rise rapidly from 
values O < 1 early in reionization to values C ~ 10 as 
reionization comes close to completion and shows behaviour 
that is qualitative ly similar to that found in simulations (e.g. 
iTrac fe Cenll2007l ). Figure [3] shows an example of the evolu- 
tion of C from this calculation. 

One aspect that we ignore is the photoionization of 
mini-halos, which could consu me UV photons and th us de- 
lay reionization by Az ~ 1 liBarkana fc Loebl 12002). The 
clumping could also be affected by the heating associated 
with photoionization, which would tend to smooth out the 
gas and so reduce C l|Pawlik et al.l[2009l ). 

A4 CMB optical depth 

Once we have the evolution of the mean neutral fraction we 
can calculate the CMB optical depth directly. 

f z CMB fa 

tcmb = / dz — [1 + fn c (z)]Qmi(z)n H (z)(jT, (A17) 
Jo dz 

where or is the Thompson cross-section and /h c is a cor- 
rection due to the presence of helium and we assume that 
Hel is ionized at the same time as HI and that Hell reion- 
ization occurs at z — 3. A more detailed calculation would 
incorporate the covariance of the measured tcmb with other 
cosmological parameters, but that lies beyond the scope of 
this paper. 
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